A perceptually balanced loss function for short-time spectral amplitude estimation

نویسندگان

  • Patrick J. Wolfe
  • Simon J. Godsill
چکیده

Here we present a novel approach to audio signal enhancement based on psychoacoustic principles. Specifically, we describe a short-time spectral amplitude estimator whose form comprises a weighted sum of the minimum mean-square error solution and the observed spectral value, where the weighting factor is given by the ratio of the masked threshold and this observed value. We then explore the connection between our approach and the idea of socalled balanced loss functions in statistics, showing the former to be an instance of the latter with a very special choice of weighting factor. Lastly, we present results indicating the relative merits of our approach in both objective and subjective terms, as compared to standard minimum mean-square error estimation under the assumed model. Software and sound examples are available at http://www-sigproc.eng.cam.ac.uk/ pjw47.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient β-order Perceptually Motivated Spectral Amplitude Bayesian Estimator Based On Chi-distribution for Speech Enhancement

The traditional Bayesian estimator of short-time spectral amplitude is based on the minimization of the squared-error cost function under the common Gaussian probability density function (pdf). The Gaussian distribution, however, is not the optimal probability distribution. To overcome this phenomenon, we considered to replace the traditional distribution hypothesis of spectral amplitude of spe...

متن کامل

Estimation of Scale Parameter in a Subfamily of Exponential Family with Weighted Balanced Loss Function

Suppose x1,x2, x3, ..., xn is a random sample of size n  from a distribution with pdf...[To continue please click here]

متن کامل

Towards a perceptually optimal spectral amplitude estimator for audio signal enhancement

We present a statistical model-based approach to signal enhancement in the case of additive broadband noise. Because broadband noise is localised in neither time nor frequency, its removal is one of the most pervasive and difficult signal enhancement tasks. In order to improve perceived signal quality, we take advantage of human perception and define a best estimate of the original signal in te...

متن کامل

Speech Prior Estimation for Generalized Minimum Mean-Square Error Short-Time Spectral Amplitude Estimator

In this paper, we introduce a generalized minimum meansquare error short-time spectral amplitude estimator with a new prior estimation of the speech probability density function based on momentcumulant transformation. From the objective and subjective evaluation experiments, we show the improved noise reduction performance of the proposed method. key words: generalized MMSE STSA estimator, spee...

متن کامل

Distributed multichannel speech enhancement based on perceptually-motivated Bayesian estimators of the spectral amplitude

In this study, the authors propose multichannel weighted Euclidean (WE) and weighted cosh (WCOSH) cost function estimators for speech enhancement in the distributed microphone scenario. The goal of the work is to illustrate the advantages of utilising additional microphones and modified cost functions for improving signal-to-noise ratio (SNR) and segmental SNR (SSNR) along with log-likelihood r...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003